Visibility Culling per Cache Block with Tiling-Traversal Algorithm

نویسندگان

  • Moon-Hee Choi
  • Woo-Chan Park
چکیده

As many applications in computer graphics require render high complex 3D scenes at interactive rates, the search for an effective visibility culling method has become one of the most important issues to be addressed in the design of 3D rendering processors. In this paper, we proposed a new rasterization pipeline with visibility culling; the proposed architecture performs the visibility culling at an early stage of the rasterization pipeline by retrieving data in a pixel cache without any significant hardware logic such as hierarchical z-buffer. When the proposed visibility culling method is performed, cache block misses can occur. For reducing this occurring rate, we fetch next pixel cache blocks previously, which can be predicted by tiling-traversal algorithm. Simulation results show that the proposed architecture can achieve a performance gain of about 41% compared with the conventional pre-texturing architecture, and about 17% compare to the hierarchical z-buffer visibility scheme. Keyword: 3D graphics, graphics hardware, rendering processor, visibility culling, and pixel cache

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Tiling, Block Data Layout, and Memory Hierarchy Performance

Recently, several experimental studies have been conducted on block data layout in conjunction with tiling as a data transformation technique to improve cache performance. In this paper, we analyze cache and TLB performance of such alternate layouts (including block data layout and Morton layout) when used in conjunction with tiling. We derive a tight lower bound on TLB performance for standard...

متن کامل

Compiler Optimizations for Cache Locality and Coherence

Almost every modern processor is designed with a memory hierarchy organized into several levels, each of which is smaller, faster, and more expensive than the level below. High performance requires the eeective use of the cached data, i.e. cache locality. Smart compiler transformations can relieve the programmer from hand-optimizing for the speciic machine architectures. In a multiprocessor sys...

متن کامل

Performance Evaluation of Global Sequence Alignment Algorithm on Multicore Architectures With Reference to Cache

Several experimental studies have been conducted over last decade on block data array in conjunction with tiling as a data transformation technique to improve cache performance. Based on the tile size and cache performance analysis, we propose a new data block size selection method – here we call it as a buffer size selection, which tightly fits into cache to get optimal solution for wave front...

متن کامل

Conservative From-Point Visibility

Visibility determination has been an important part of the computer graphics research for several decades. First studies of the visibility were hidden line removal algorithms, and later hidden surface removal algorithms. Today’s visibility determination is mainly concentrated on conservative, object level visibility determination techniques. Conservative methods are used to accelerate the rende...

متن کامل

Layered Point Clouds

We present a simple point-based multiresolution structure for interactive visualization of very large point sampled models on consumer graphics platforms. The structure is based on a hierarchy of precomputed object-space point clouds. At rendering time, the clouds are combined coarse-to-fine with a top-down structure traversal to locally adapt sample densities according to the projected size in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004